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Abstract. This paper is a short summary of our recent work on the 
medians and means of probability measures in Riemannian manifolds. 
Firstly, the existence and uniqueness results of local medians are given. 
In order to compute medians in practical cases, we propose a subgradient 
algorithm and prove its convergence. After that, Frechet medians are con- 
sidered. We prove their statistical consistency and give some quantitative 
estimations of their robustness with the aid of upper curvature bounds. 
We also show that, in compact Riemannian manifolds, the Frechet medi- 
ans of generic data points are always unique. Stochastic and deterministic 
algorithms are proposed for computing Riemannian p-means. The rate 
of convergence and error estimates of these algorithms are also obtained. 
Finally, we apply the medians and the Riemannian geometry of Toeplitz 
covariance matrices to radar target detection. 

1 Introduction 

It has been widely accepted that the history of median begins from the following 
question raised by P. Fermat in 1629: given a triangle in the plan, find a point 
such that the sum of its distances to the three vertices of the triangle is minimum. 
It is well known that the answer to this question is: if each angle of the triangle 
is smaller than 27r/3, then the minimum point is such that the three segments 
joining it and the vertices of the triangle form three angles equal to 27r/3; in the 
opposite case, the minimum point is the vertex whose angle is no less than 27r/3. 
This point is called the median or the Fermat point of the triangle. 

The notion of median also appears in statistics since a long time ago. In 1774, 
when P. S. Laplace tried to find an appropriate notion of the middle point for a 
group of observation values, he introduced "the middle of probability" , the point 
that minimizes the sum of its absolute differences to data points, this is exactly 
the one dimensional median used by us nowadays. 

A sufficiently general notion of median in metric spaces was proposed in 1948 
by M. Frechet in his famous article [53], where he defined a p-mean of a random 
variable X to be a point which minimizes the expectation of its distance at the 
power p to X. This flexible definition allows us to define various typical values, 



among which there are two important cases: p — 1 and p — 2, corresponding to 
the notions of median and mean, respectively. 

Apparently, the median and mean are two notions of centrality for data 
points. As a result, one may wonder that which one is more advantageous? 
Statistically speaking, the answer to this question depends on the distribution 
involved. For example, the mean has obvious advantage over the median when 
normal distributions are used. On the contrary, as far as Cauchy distributions are 
concerned, the empirical mean has the same accuracy as one single observation, 
so that it would be better to use the median instead of the mean in this situation. 
Perhaps the most significant advantage of the median over the mean is that the 
former is robust but the latter is not, that is to say, the median is much less 
sensitive to outliers than the mean. Roughly speaking, in order to move the 
median of a group of data points to arbitrarily far, at least a half of data points 
should be moved. Oppositely, in order to move the mean of a group of data 
points to arbitrarily far, it suffices to move one data point. So that medians are 
in some sense more prudent than means, as argued by M. Frechet. The robustness 
property makes the median an important estimator in situations when there are 
lots of noise and disturbing factors. 

The first formal definition of means for probability measures on Riemannian 
manifolds was made by H. Karcher in [24 . To introduce Karcher's result con- 
cerning means, consider a Riemannian manifold M with Riemannian distance d 
and 

B{a, p) ^ {x e M : d{x, a) < p} 

is a geodesic ball in Af centered at a with a finite radius p. Let A be an upper 
bound of sectional curvatures in B{a,p) and inj be the injectivity radius of 
B(a,p). Under the following condition: 

where if Z\ < 0, then tt/ (4%/^) is interpreted as -|-cxi, Karcher showed that, with 
the aid of estimations of Jacobi fields, the local energy functional 

F^: B(a,p)^R+, x^f d^{x,p)p{dp) (2) 

J M 

is strictly convex, thus it has a unique minimizer which he called the 

Riemannian center of mass of the probability measure fx. Moreover, h{p) is also 
the unique solution of the following equation: 

eyiY>~^ p p{dp) =0x, xeB{a,p). (3) 

From then on, local means of probability measures on Riemannian manifolds 
are also called Karcher means, meanwhile, global means are often called Frechet 
means. A rather general result concerning the uniqueness of local means was 
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proved by W. S. Kendall in As a particular case of Kendall's result, the 
condition ^ 

p < - min { , inj } (4) 

is sufficient to ensure the uniqueness of the Kacher means of fi. 

Some generalizations of Karcher mean are given by many authors. For in- 
stance, M. Emery and G. Mokobodzki defined in (21] the exponential barycenters 
and convex barycenters for measures on affine manifolds. They also showed that 
a point a; is a convex barycenter of a probability fi if and only if there exists 
a continuous martingale starting from x with terminal law /x. The uniqueness 
of exponential barycenters are generalized by M. Arnaudon and X. M. Li in [3] 
to probability measures on convex affine manifolds with semilocal convex ge- 
ometry. Moreover, the behavior of exponential barycenters when measures are 
pushed by stochastic flows is also considered in [3]. In order to study harmonic 
maps between Riemannian manifolds with probabilistic methods, J. Picard also 
gave a generalized notion of barycenters in _27j. As we noted before, Karcher 
means are only local minimizers of the energy functional in ©, but it is 
easily seen that can be defined not only on the closed ball B{a, p) but also 
on the whole manifold M as long as the second moment of p, is finite. This 
leads to the global minimizers of the second moment function of /i, which is 
just the original definition of means made by Frechet. Global minimizers are 
more useful in statistics than local ones, so that it is necessary to know whether 
or under which conditions the Karcher mean of ^ is in fact the Frechet mean. 
For the case when /j, is a discrete measure supported by finitely many points in 
the closed upper hemisphere, S. R. Buss and J. P. Fillmore showed in [17 that 
if the support of /i is not totally contained in the equator then has a unique 
Karcher mean which lies in the open hemisphere and equals to the Frechet mean. 
Inspired by the methods of Buss and Fillmore, B. Afsari showed in [T] that if 
the upper curvature bound A and the injectivity radius inj in Q is replaced by 
the ones of the larger ball B{a, 2p), then all the Frechet p-means of /x lie inside 
B(a,p). Particularly, the Karcher mean coincides with the Frechet mean. The 
existence and uniqueness of p- means in Finsler geometry are recently proved by 
M. Arnaudon and F. Nielsen in They also showed that Finslerian p-means 
are limiting points of continuous gradient flows and developed algorithms for 
computing p-means in Finsler geometry. 

Medians of discrete sample points on the sphere are studied by economists 
and operational research experts in the 1970s and 1980s, but they used the name 
"location problems on a sphere" . For data points lying in a spherical disc of radius 
smaller than 7r/4, Drezner and Wesolowsky showed in [1^ that the cost function 
is unimodal in that disc and the Frechet median is unique if the data points are 
not contained in a single great circle. It is also shown by Z. Drezner in [^U] that 
if all the sample points are contained in a great circle, then one of the sample 
points will be a Frechet median. Perhaps the first work about Frechet medians 
on Riemannian manifolds is the paper [5^ by R. Noda and his coauthors. They 
proved the uniqueness, characterizations and position estimations of Frechet 
medians for discrete sample points lying in a Cartan-Hadamard manifold. In 



order to do robust statistics for data living in a Riemannian manifold P. T. 
Fletcher and his coauthors defined in [52] the local medians for discrete sample 
points and showed their existence and uniqueness. 

In this paper, we present our results on medians and means of probability 
measures in Riemannian manifolds. Above all, the motivation of our work: radar 
target detection is introduced in section [5] After that, in section |3] we define 
local medians for probability measures in Riemannian manifolds and consider 
the problems of uniqueness and approximation. Under the assumption that the 
support of the probability measure is contained in a convex ball, we give some 
results on the characterization, the position estimation and the uniqueness of 
medians. Then we propose a subgradient algorithm to estimate medians as well 
as giving its convergence result without condition of the sign of curvatures. Our 
algorithm improves the one proposed in |22| which is shown to be convergent only 
if the manifold is nonnegatively curved. Finally, the problem of error estimation 
and rate of convergence are also considered. 

The aim of section |4] is to give some basic properties of Frechet medians of 
probability measures in Riemannian maniolds. Firstly, we give the consistency 
result of Frechet medians in proper metric spaces. Particularly, if a probability 
measure has only one Frechet median, then any sequence of empirical Frechet 
medians will converge almost surely to it. After that, we study the robustness 
of Frechet medians in Riemannian manifolds. It is well known that in Euclidean 
spaces, if a group of data points has more than a half concentrated in a bounded 
region, then its Frechet median cannot be drown arbitrarily far when the other 
points move. A generalization and refinement of this result for data points in 
Riemannian manifolds is given in Theorem [T] This theorem also generalizes 
a result in [I] which states that if the probability measure is supported in a 
strongly convex ball, then all its Frechet medians lie in that ball. At the end 
of this section, the uniqueness question of Frechet sample medians is considered 
in the context of compact Riemannian manifolds. It is shown that, apart from 
several events of probability zero, the Frechet sample medians are unique if the 
sample vector has a density with respect to the canonical Lebesgue measure of 
the product manifold. In other words, the Frechet medians of generic data points 
are always unique. 

Section [5] is devoted to presenting algorithms for computing Frechet p-means 
in order to meet practical needs. Theorem [TUl gives stochastic algorithms which 
converge almost surely to p-means in manifolds, which are easier to implement 
than gradient descent algorithm since computing the gradient of the function 
to minimize is not needed. The idea is at each step to go in the direction of a 
point of the support of /x. The point is chosen at random according to jj, and 
the size of the step is a well chosen function of the distance to the point, p 
and the number of the step. The speed of convergence is given by Theorem [TT| 
which says that the renormalized inhomogeneous Markov chain of Theorem 1101 
converges in law to an inhomogeneous diffusion process. We give the explicit 
expression of this process, as well as its local characteristic. After that, the 
performance of the stochastic algorithms are illustrated by simulations. Finally, 



we show that the p-mean of fi can also be computed by the method of gradient 
descent. The questions concerning the choice of stepsizes and error estimates of 
this deterministic method are also considered. We note that, for the case when 
p = +00, M. Arnaudon and F. Nielsen developed in ^ an efficient algorithm to 
compute the circum-center Coo of probability measures in Riemannian manifolds. 

In section ini we consider the manifold of n x n Toeplitz covariance matrices 
parameterized by the reflection coefficients which are derived from Levinson's 
recursion of autoregressive models. The explicit expression of the reparametriza- 
tion and its inverse are obtained. With the Riemannian metric given by the 
Hessian of a Kahler potential, we show that the manifold is in fact a Cartan- 
Hadamard manifold with lower sectional curvature bound —4. After that, we 
compute the geodesies and use the subgradient algorithm introduced in section 
[3]to find the median of Toeplitz covariance matrices. Finally, we give some simu- 
lated examples to illustrate the application of the median method to radar target 
detection. 

2 Motivation: radar target detection 

Suggested by J. C. Maxwell's seminal work on electromagnetism, H. Hertz car- 
ried out an experiment in 1886 which validated that radio waves could be re- 
flected by metallic objects. This provided C. Hiielsmeyer the theoretical foun- 
dation of his famous patent on "telemobiloscope" in 1904. He showed publicly 
in Germany and Netherlands that his device was able to detect remote metal- 
lic objects such as ships, even in dense fog or darkness, so that collisions could 
be avoided. Hiielsmeyer's "telemobiloscope" is recognized as the primogenitor 
of modern radar even though it could only detect the direction of an object, 
neither its distance nor its speed. This is because the basic idea of radar was 
already born: send radio waves in a predetermined direction and then receive 
the possible echoes reflected by a target. In order to know the distance and the 
radial speed of the target, it suffices to send successively two radio waves. In 
fact, it is easily seen that the distance d of the target can be computed by the 
formula 
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where c is the speed of light and At is the time interval between every emission 
and reception in the direction under test. Moreover, the radial speed v of the 
target can be deduced by the Doppler effect which states that the frequency of 
a wave is changed for an observer moving relatively to the source of the wave. 
More precisely, 

_ XAip 
4^' 

where A and Aip are the wavelength and the skewing of the two emitted radio 
waves, respectively. As a result, the direction, the distance and the speed of the 
target can all be determined. 



For simplicity, from now on we only consider a fixed direction in which a radar 
sends radio waves. Since the range of emitted waves are finite, we can divide this 
direction into some intervals each of which represents a radar cell under test. The 
radar sends each time a rafale of radio waves in this direction and then receive 
the returning echoes. For each echo we measure its amplitude r and phase (p, 
so that it can be represented by a complex number z = re*"^. As a result, the 
observation value of each radar cell is a complex vector Z = (zi, . . . , zjy), where 
TV is the number of waves emitted in each rafale. 

The aim of target detection is to know whether there is a target at the lo- 
cation of some radar cell in this direction. Intuitively speaking, a target is an 
object whose behavior on reflectivity or on speed is very different from its envi- 
ronment. The classical methods for target detection is to compute the difference 
between the discrete Fourier transforms of the radar observation values of the 
cell under test and that of its ambient cells. The bigger this difference is, the 
more likely a target appears at the location of the cell under test. However, 
the performance of these classical methods based on Doppler filtering using dis- 
crete Fourier transforms together with the Constant False Alarm Rate (CFAR) 
is not very satisfactory due to their low resolutions issues in perturbed radar 
environment or with smaller bunch of pulses. 

In order to overcome these drawbacks, a lot of mathematical models for 
spectra estimation were introduced, among which the method based on autore- 
gressive models proposed by F. Barbaresco in [7 is proved to be very preferable. 
We shall introduce this method in Chapter |6] of this dissertation. The main dif- 
ference between this new method and the classical ones is that, instead of using 
directly the radar observation value Z of each cell, we regard it as a realiza- 
tion of a centered stationary Gaussian process and identify it to its covariance 
matrix R — E[ZZ*]. Thus the new observation value for each radar cell is a 
covariance matrix which is also Toeplitz due to the stationarity of the process. 
As a result, the principle for target detection becomes to find the cells where the 
covariance matrix differs greatly from the average matrix of its neighborhood. 
Once such cells are determined we can conclude that there are targets in these 
locations. In order to carry out this new method, there are two important things 
which should be considered seriously. One is to define a good distance between 
two Toeplitz covariance matrices. The other is to give a reasonable definition of 
the average of covariance matrices, which should be robust to outliers so as to 
be adapted to perturbed radar environment, and develop an efficient method to 
compute it in practical cases. These works will be done in the following by study- 
ing the Riemannian geometry of Toeplitz covariance matrices and the medians 
of probability measures in Riemannian manifolds. 

3 Riemannian median and its estimation 

In this section, we define local medians of a probability measure on a Riemannian 
manifold, give their characterization and a natural condition to ensure their 
uniqueness. In order to compute medians in practical cases, we also propose a 



subgradient algorithm and show its convergence. The mathematical details of 
this section can be found in [5T| . 

In more detail, let M be a complete Riemannian manifold with Riemannian 
metric ( • , • ) and Riemannian distance d. We fix an open geodesic ball 

B{a, p) ^ {x e M : d{x, a) < p} 

in M centered at a with a finite radius p. Let 5 and A denote respectively a 
lower and an upper bound of sectional curvatures K in B{a,p). The injectivity 
radius of B{a,p) is denoted by inj {B{a,p)). Furthermore, we assume that the 
radius of the ball verifies 

. . TT inj {B{a,p)) 
^"""""^iVZ' 2 

where if Z\ < 0, then 7r/(4v^) is interpreted as 4-oo. 

We consider a probability measure p on M whose support is contained in 
the open ball B{a,p) and define a function 

/: B{a,p) — ^R+, x\ — > I d{x,p)p{dp). 

J M 

This function is 1-Lipschitz, hence continuous on the compact set B{a,p). 
The convexity of the distance function on B{a, p) yields that / is also convex. 
Hence we don't need to distinguish its local minima from its global ones. Now 
we can give the following definition: 

Definition 1. A minimum point of f is called a median of p. The set of all the 
medians of p will he denoted by 9Jt^ . The minimal value of f will he denoted hy 
/*■ 

It is easily seen that is compact and convex. Moreover, by computing 
the right derivative of / we can prove the following characterization of 971^. 

Theorem 1. The set 9Jt^ is characterized hy 

Mf^^ixe B{a,p) : \H{x)\ < p{x}}, 

where for x G B(a,p), 

Jm\{x} d[x,p) 

is a tangent vector at x satisfying \H{x)\ < 1. 

Observing that every geodesic triangle in B{a,p) has at most one obtuse 
angle, we can prove the following result which gives a position estimation for the 
medians of p. 



Proposition 1. SJt^ is contained in the smallest closed convex subset of B[a,p) 
containing the support of fi. 

In Euclidean case, it is well known that if the sample points are not collinear, 
then their medians are unique. Hence we get a natural condition of fi to ensure 
the uniqueness for medians in Riemannian case: 

* The support of fi is not totally contained in any geodesic. This means 
that for every geodesic 7; [0, 1] — > B{a,p), we have /i(7[0,l]) < 1. 

This condition implies that / is strictly convex along every geodesic in 
B(a,p), so that it has one and only one minimizer, as stated by the theorem 
below. 

Theorem 2. // condition * holds, then /i has a unique median. 

With further analysis, we can show a stronger quantitative version of The- 
orem [21 which is crucial in the error estimations of the subgradient algorithm 
as well as in the convergence proof of the stochastic algorithm for computing 
medians in section [SJ 

Theorem 3. // condition * holds, then there exits a constant t > such that 
for every x € B{a,p) one has 



where m is the unique median of p. 

The main results of approximating medians of fi by subgradient method is 
summarized in the following theorem. The idea stems from the basic observation 
that H{x) is a subgradient of / at a; for every x G B{a, p). 

Theorem 4. Let {tk)k be a sequence of real numbers such that 



Then there exists some constant T > such that if we choose t^ < T for every 
k > 0, then the sequence {xk)k is contained in B{a,p) and verifies 



f{x) > +Td'^{x,m), 



00 



tk > 0, lim tk = and 




Define a sequence {xk)k by xq g B{a,p) and for k > 0, 





lim d{xk,M^) = and lim f{xk) = f*. 



Moreover, if the sequence {tk)k clIso verifies 

oo 

^4 < +00, 

fc=0 

then there exists some m G SH^ such that Xk — > m. 

Remark 1. We can choose the constant T in Theorem 2] to be 

r = P—^ 

C{p,5)F{p,A) + r 
where <j = sup{d(p, a) : p € supp /i}, 



and 



1, if Z\ > 0; 

cosh(2py^), if Z\ < 0, 



1, if(5>0; 
2pV^coth(2pV^), if (5 < 0. 
The proposition below gives the error estimation of the algorithm in Theorem 



in 

Proposition 2. Let condition * hold and the stepsizes {tk)k in Theorem^sat- 
isfy 

00 

lim tk = and N tk = +00. 

k^oo 

k=0 

Then there exists G N, such that for every k > N , 

d^{xk,m) < bk, 

where m is the unique median of p and the sequence {bk)k>N is defined by 

bN = {p + of and bk+i = il-2Ttk)bk + C{p,S)tl, k>N, 
which converges to when fc — )■ 00. More explicitly, for every k > N , 

&fc+i = (p + a)2 J](l-2rt,)+C(p,<5)( t'j-il[(^-2Tt,)+tl). 

i=N j=N+l i=j 

4 Some properties of Prechet medians in Riemannian 
manifolds 

This section is devoted to some basic results about Frechet medians, or equiv- 
alently, global medians. We show the consistency of Frechet medians in proper 
metric spaces, give a quantitative estimation for the robustness of Frechet me- 
dians in Riemannian manifolds and show the almost sure uniqueness of Frechet 
sample medians in compact Riemannian manifolds. We refer to ^2] for more 
details of this section. 



4.1 Consistency of Frechet medians in metric spaces 

In this subsection, we work in a proper metric space (M, d) (recall that a metric 
space is proper if and only if every bounded and closed subset is compact). Let 
Pi{M) denote the set of all the probability measures ^ on M verifying 

/ d{xo,p)fi{dp) < oo, for some xq G M. 
Jm 

For every /i G Pi{M) we can define a function 

: M — S-R+, XI — > d{x,p)fi{dp). 

Jm 

This function is 1-Lipschitz hence continuous on M. Since M is proper, attains 
its minimum (see p. 42]), so we can give the following definition: 

Definition 2. Let ^ he a probability measure in Pi{M), then a global minimum 
point of ff_t is called a Frechet median of fi. The set of all the Frechet medians 
of /i is denoted by . Let f* denote the global minimum of ff^ . 

By the Kantorovich-Rubinstein duality of L^-Wasserstein distance (see [30l 
p. 107]), we can show that Frechet medians are characterized by 1-Lipschitz 
functions. A corresponding result that Riemannian barycenters are characterized 
by convex functions can be found in |25| Lemma 7.2]. 

Proposition 3. Let /i G Pi{M) and M he also separable, then 

= ja; G M : ip{x) < f* + J f{p)n{dp), for every ip G Lipi(Af)|, 

where Lipi{M) denotes the set of all the 1-Lipschitz functions on M . 

The following theorem states that the uniform convergence of first moment 
functions yields the convergence of Frechet medians. 

Theorem 5. Let (/i„)n6N be a sequence in Pi{M) and /i be another probability 
measure in Pi{M). If {f^^)n converges uniformly on M to ffi, then for every 
e > 0, there exists G N, such that for every n > N we have 

Qf,^ C B(Q^, e) := {x e M : d{x, Q^) < e}. 

As a corollary to Theorem [SJ Frechet medians are strongly consistent esti- 
mators. The consistency of Frechet means is proved in |15| . 

Corollary 1. Let (A„)„gN be a sequence of i.i.d random variables of law fi G 
Pi(M) and (TO„)fieN be a sequence of random variables such that to„ G Qp„ 
with fin — X]fe=i ■ V M f^'^s a unique Frechet median m, then nin — > m 
a.s. 



4.2 Robustness of Frechet medians in Riemannian manifolds 



The framework of this subsection is a complete Riemannian manifold (M, d) 
whose dimension is no less than 2. We fix a closed geodesic ball 

B{a, p) ^ {x e M : d{x, a) < p} 

in M centered at a with a finite radius p > and a probability measure fi € 
Pi{M) such that 

p{B{a,p)) ^a> -. 

The aim of this subsection is to estimate the positions of the Frechet medi- 
ans of /i, which gives a quantitative estimation for robustness. To this end, the 
following type of functions are of fundamental importance for our methods. Let 
x,z G M, define 

h^^z ■ B{a,p) — R, p\ — > d{x,p) - d{z,p). 

Obviously, hx^z is continuous and attains its minimum. 

By a simple estimation on the minimum of /i^.a we get the following basic 
result. 

Theorem 6. The set of all the Frechet medians of fi verifies 

Remark 2. It is easily seen that the conclusion of Theorem [S] also holds if M is 
only a proper metric space. 

Remark 3. As a direct corollary to Theorem [6l if is a probability measure in 
Pi{M) such that for some point m G M one has fJ,{m} > 1/2, then m is the 
unique Frechet median of /i. 

In view of Theorem |6l let A be an upper bound of sectional curvatures in 
i3* and inj be the injectivity radius of _B*. By computing the minima of some 
typical functions h^.z in model spaces and H^, and then comparing with 

the ones in M, we get the following main result of this subsection. 



Theorem 7. Assume that 



2ap . r . . . 

< n := mm{^= , mj |, (6) 



2a - 1 ' 
where if A <0, then tt/^/A is interpreted as +oo. 
i) If A> and C B{a,r^/2), then 

i=,f I . ,asm{VAp). 
On C -B[ a, — = arcsm , 



Moreover, any of the two conditions below implies C B{a,r^:/2): 



2ap r* 2ap r, 

2a — 12 2a— 12 2 



where Fan Ait) ^ cot(V A{2a ~ l)t) - cotW At) - 2 cotW Ap) , te (0, — — 

2a — 1 



ii)IfA = 0, then 



Hi) If A <0, then 



V2a- 1 



C -B ( a, ^"^^ arcsinh ^^'^^("^^ ' 



-Z\ ^ V2a- 1 

Finally any of the above three closed balls is contained in the open ball B{a, r^/2) . 

Remark 4- Although we have chosen the framework of this section to be a Rie- 
mannian manifold, the essential tool that has been used is the hinge version of 
the triangle comparison theorem. Consequently, Theorem [7] remains true if M is 
a CAT(Z\) space (see [121 Chapter 2]) and is replaced by 'nj^fA. 

Remark 5. For the case when a = 1, the assumption ^ becomes 



Observe that in this case, when A > 0, the condition Fi.p.zi(r,/2 — p) < is 
trivially true in case of need. Hence Theorem [7] yields that C B{a,p), which 
is exactly what the Theorem 2.1 in [1] says for medians. 



4.3 Uniqueness of Prechet sample medians in compact Riemannian 
manifolds 

Before introducing the results of this subsection we give some notations. For each 
point X G M, Sx denotes the unit sphere in TxM. Moreover, for a tangent vector 
V (z Sx, the distance between x and its cut point along the geodesic starting from 
X with velocity v is denoted by t(v). Certainly, if there is no cut point along this 
geodesic, then we define t{v) = +oo. For every point (xi, . . . ,xn) G , where 
A'^ > 3 is a fixed natural number, we write 

1 ^ 

p{xi,...,xn) = J^'^Sx^- 

k=l 

The set of all the Frechet medians of p.{xi, . . . , xat), is denoted by . . . , xat). 

The following theorem states that in order to get the uniqueness of Frechet 
medians, it suffices to move two data points towards a common median along 
some minimizing geodesies for a little distance. 



Theorem 8. Let {xi, . . . ,xn) G and m e Q{xi, . . . ,xn)- Fix two normal 
geodesies 71,72 : [0,+oo) — > M such that 71 (0) = Xi, ji{d{xi,m)) ~ m, 72(0) = 
X2 and ^2{d{x2,m)) — m. Assume that 

/7i[0,r(7i(0))], z/t(7i(0)) <+oo,- 

Xo S: < 

^\7i[0,+^), z/t(7i(0)) -+00. 
Then for every t e (0, d{xi, m)] and s £ (0, d{x2, m)] we have 

Q{ll{t),l2{s),X3,...,XN) = {m}. 

Generally speaking, the non uniqueness of Frechet medians is due to some 
symmetric properties of data points. As a result, generic data points should have 
a unique Frechet median. In mathematical language, this means that the set of 
all the particular positions of data points is of Lebesgue measure zero. After 
eliminate all these particular cases we obtain the following main result: 

Theorem 9. Assume that M is compact. Then fi(xi, . . . ,xn) has a unique 
Frechet median for almost every (xi, . . . , xjv) G M^. 

Remark 6. In probability language, Theorem[9]is equivalent to say that if (Xi , . . . , 
Xjy) is an M^-valued random variable with density, then fJ,{Xi, . . . , Xjv) has a 
unique Frechet median almost surely. Clearly, the same statement is also true if 
Xi,. . . , Xn are independent and M- valued random variables with density. 

5 Stochastic and deterministic algorithms for computing 
means of probabihty measures 

In this section, we consider a probability measure ^ supported by a regular 
geodesic ball in a manifold and, for any p > 1, define a stochastic algorithm 
which converges almost surely to the p-mean Cj, of /i. Assuming furthermore 
that the functional to minimize is regular around e^, we prove that a natural 
renormalization of the inhomogeneous Markov chain converges in law into an 
inhomogeneous diffusion process. We give the explicit expression of this process, 
as well as its local characteristic. After that, the performance of the stochastic 
algorithms are illustrated by simulations. Finally, we show that the p-mean of 
jj, can also be computed by the method of gradient descent. The questions con- 
cerning the choice of stepsizes and error estimates of this deterministic method 
are also considered. For more mathematical details of this section, see (4j and 

5.1 Stochastic algorithms for computing p-means 

Let M be a Riemannian manifold whose sectional curvatures K{a) verify — < 
K(a) < a^, where a, /3 are positive numbers. Denote by p the Riemannian 
distance on M. Let B{a, r) be a geodesic ball in M and be a probability 
measure with support included in a compact convex subset if^ of B{a,r). Fix 
p S [1,00). We will always make the following assumptions on {r,p,ii): 



Assumption 1 The support of jj, is not reduced to one point. Either p > 1 or 
the support of ^ is not contained in a line, and the radius r satisfies 



r < ra.p with 



a^p = imin{inj(Af),^} , if p e [1,2); 
a^p = i min {inj(M), f } , if p G [2, oo). 



Under Assumption [TJ it has been proved in Theorem 2.1] that the function 



Hp : M 



M 

has a unique minimizer Cp in M, the p-mean of fi, and moreover Cp € B{a, r). If 
p = 1, ei is the median of /z. 

Remark 7. The existence and uniqueness of p-means in Finsler geometry are 
recently proved by M. Arnaudon and F. Nielsen in [6]. They also showed that 
Finslerian p- means are limiting points of continuous gradient flows and developed 
algorithms for computing p-means in Finsler geometry. 

In the following theorem, we define a stochastic gradient algorithm {Xk)k>o 
to approximate the p-mean ep and prove its convergence. In the sequel, let 

p(Kn,B(a,rY) 

K ^ B{a,r -e) with e ^ ^ ^ ' ' ' . 



Theorem 10. Let {Pk)k>i be a sequence of independent B(a,r)-valued random 
variables, with law ji. Let {tk)k>i be a sequence of positive numbers satisfying 

V^>1 t.<™=- ' PiK.^Bia,rY) 



Cp,^^k' 2p(2r)P-i 

CX3 OO 

^fc = +CXD and ifc < oo, 

k=l k=l 

where Cp^p^^K > is a constant. 

Letting xq Cz K , define inductively the random walk {Xk)k>a by 

Xo = xq and for k > Xk+i = exp^^ (-ifc+i grad^^-^ Fp{-,Pk+i)) 

where Fp{x, y) — p^{x, y), with the convention grad^ Pp{'^ ^) = 0. 

The random walk {Xk)k>i converges in L^ and almost surely to ep. 

Remark 8. For the case when p = +oo, M. Arnaudon and F. Nielsen developed 
in [5] an efficient algorithm to compute the circum-center Coo of probability 
measures in Riemannian manifolds. 

In the following example, we focus on the case AI = and p = 2 where 
drastic simplifications occur. 



Example 1. In the case when M — M."^ and /i is a compactly supported proba- 
bihty measure on M.'^, the stochastic gradient algorithm ([7]) simplifies into 

Xo = xo and for fc > Xk+i ^ Xk - tk+i grsidx^ Fp{-, Pk+i). 

If furthermore p = 2, clearly 62 — ^[Pi] and grad^ Fp{-,y) = 2{x — y), so that 
the linear relation 

Xk+i = (1 - 2tfe+i)Xfc + 2ife+iPfc+i, fc > 

holds true and an easy induction proves that 

k-l j~l 

Xk=xoY[il-2tk-j)+2j2Pk~jtk-jY[{l-2tk-i), k>l. (7) 

j=o j=o e=Q 

Now, taking tk — —, we have 
2fc 

fc - J 



Y[{1 ^ 2tk-j) = and Y[{l-2tk-e) 



so that 



j=0 j=l 

The stochastic gradient algorithm estimating the mean 62 of fi is given by the 
empirical mean of a growing sample of independent random variables with dis- 
tribution ji. In this simple case, the result of Theorem [TU] is nothing but the 
strong law of large numbers. Moreover, fluctuations around the mean are given 
by the central limit theorem and Donsker's theorem. 

The fluctuation of the random walk {Xk)k defined in Theorem [10] is summa- 
rized in the following theorem. 

Theorem 11. Assume that in Theorem [Tdi 

S f 1 piK^,B{a,rr) 



tk = min — , mill — , — ^ z , fc > 1, 

\k \Cp,^,k' 2p(2r)f-i J J' 

for some 6 > 0. We define for n > 1 the Markov chain {Yi^)k>o in T^^M by 

n" = ;|exp-i^fc. 

Assume that Hp is in a neighborhood of Cp and S > ^. Then the se- 
quence of processes (^^[J^t]^ converges weakly in D((0, 00), Te^M) to a diffusion 
process given by 



ys 



,-1 "'0 



ei)ei, i > 0, 



where Bt is the standard Brownian motion in Te^M and a G 'EiTiA[Te^M) satis- 
fying 

GO* = E [grad,^ Fp[-,Px) ® grad,^ -Fp(-, ^i)] , 

{&i)\<i<d is an orthonormal basis diagonalizing the symmetric bilinear form 
WdHp{ep) and {\i)i<i<d are the associated eigenvalues. 



5.2 Simulations of stochastic algorithms 

A non uniform measure on the unit square in the plane Here M is the 
Euclidean plane and fx is the renormalized restriction to the square [0,4] x 
[0,4] of an exponential law on [0,oo) x [0,oo). The red path represents one 
trajectory of the inhomogeneous Markov chain {Xk)k>o corresponding to p = 1, 
with linear interpolation between the different steps. The red point is ei. Black 
circles represent the values of (Pfc)fe>i- 




Fig. 1. Median of a non uniform measure on the unit square in the plane 



Medians in the Poincare disc In the two figures below, M is the Poincare 
disc, the blue points are data points and the red path represents one trajec- 
tory of the inhomogeneous Markov chain {Xk)k>o corresponding to p = 1, with 
linear interpolation between the different steps. The green points are medians 
computed by the subgradient method developed in section |3l 




Fig. 3. Median of points in tlie Poicare disc 



5.3 Computing p- means by gradient descent 

Gradient descent algorithms for computing Cp are given in the following theorem. 
In view of Theorem 21 it suffices to consider the case when p > 1. 

Theorem 12. Assume that p > I. Let xq € B(a, r) and for k > define 

Xk+i = exp^^ {-tk grad^^ Hp), 
where {tk)k is a sequence of real numbers such that 

^<tk < — 2^o — t-ufoa \ I n' ,1™ ^fc = and V ife = +00. 

7rp"^(2r)^P p coth[2 pr )+ peP fe^oo ^— ^ 

Then the sequence {xk)k is contained in B{a,p) and converges to Cp. 

The following proposition gives the error estimations of the gradient descent 
algorithms in Theorem 1121 

Proposition 4. Assume that tk < for every k in Theorem \12l then the 

following error estimations hold: 

i) if 1 < p < 2, then for k > I, 

k-i 

p'^ixk,ep) <4r2 - Cp^t,,KU) 

i=0 

,k-\ fc-1 ^ 

+ C(/3, r,p)\y^ t]_^ n (1 - '^v,^^,Kt^) + iti j &fc; 

ii) if p > 2, then for k > 1, 

fc-i 

Hp{xk) - Hp{ep) < (2r)P - Cp^p^Kt^) 

i=Q 

,k-l fe-1 s 

+ r,p)iY, t]_^ - Cp^p^Kt^) + 4-1 ] := Cfc, 



where the constant 
C{l3,r,p) = 

Moreover, the sequences {bk)k and {ck)k both tend to zero 



p2(2r)2p-i/3coth(2/3r), if l<p<2; 

p3{2rfP-^ (2/3r coth(2^r) +p-2), if p>2. 



6 Riemannian geometry of Toeplitz covariance matrices 
and applications to radar target detection 

In this section we study the Riemannian geometry of the manifold of ToepHtz 
covariance matrices of order n. The exphcit expression of the reflection coef- 
ficients reparametrization and its inverse are obtained. With the Riemannian 
metric given by the Hessian of a Kahler potential, we show that the manifold 
is in fact a Cartan-Hadamard manifold with lower sectional curvature bound 
—4. The geodesies in this manifold are also computed. Finally, we apply the 
subgradient algorithm introduced in section [3] and the Riemannian geometry of 
Toeplitz covariance matrices to radar target detection. We refer to [53] for more 
mathematical details of this section. 



6.1 Reflection coefficients parametrization 

Let Tn be the set of Toeplitz Hermitian positive definite matrices of order n. It 
is an open submanifold of R-^"^^. Each element i?„ e Tn can be written as 



R„ 



Tn-l 



ri ro 



For every 1 < fc < ti — 1, the upper left (fc + l)-by-(fc + 1) corner of i?„ is denoted 
by Rk ■ It is associated to a fc-th order autoregressive model whose Yule- Walker 
equation is 





ri 


■ ■■ rk 




- 1 - 




'Pk 


ri 


ro 


■ ■ . rk-i 











rk 




ri ro 








_ 0_ 



where a^'^'' 



,(fc) 



det i?fc+i/det Rk 



. , ' are the optimal prediction coefficients and Pk 
is the mean squared error. 

(k) 

The last optimal prediction coefficient a), ' is called the fc-th reflection co- 
efficient and is denoted by /Xfc. It is easily seen that /zi, . . . , are uniquely 
determined by the matrix Moreover, the classical Levinson's recursion gives 
that |/Zfc| < 1. Hence, by letting Pq = tq, we obtain a map between two subman- 
ifolds of R2"-1: 



r„ ^ r; X D"- 



R„ 



(-Po,Mi 



, Mra-l); 



where D = {zGC:|z|<l}is the unit disc of the complex plane. 

Using the Cramer's rule and the method of Schur complement we get the 
following proposition. 



Proposition 5. ip is a diffeomorphism, whose explicit expression is 



f^k = (-1) 



k det Sk 
det Rk 



whe 



Sk = i?, 



k+l 



2,...,fc + 1 
l,...,k 



is the suhmatrix of Rk+i obtained by deleting the first row and the last column. 
On the other hand, if {Pq, /ii, • . . , /^n-i) G x D"^^, then its inverse image 
Rn under ip can be calculated by the following algorithm: 

ro = Po, ri = -PoMi, 

rk = -HkPk-i + al_iJk-iRkliak-i, 2 < fc < n - 1, 

where 



k-l 





ri 




"0 . 


. 1 






, Jk-l — 


. 


. 1 


ak-i = 










1 . 


. 



and Pk-l=Po'[[il~\n^\^). 



6.2 Riemannian geometry of Toeplitz covariance matrices 

From now on, we regard 7^ as a Riemannian manifold whose metric, which is 
introduced in [84 by the Hessian of the Kahler potential 



<?(i?„) = - ln(det Rn) - n ln(7re). 



is given by 



ds' 



dPl 
Pi 



fe=i 



\dnk\' 



(8) 



where (Pq, Mi, • • ■ , = f{Rn)- 

The metric ([5]) is a Bergman type metric and it has be shown in |33j that 
this metric is not equal to the Fisher information metric of 7^. But J. Burbea 
and C. R. Rao have proved in [IS, Theorem 2] that the Bergman metric and 
the Fisher information metric do coincide for some probability density functions 
of particular forms. A similar potential function was used by S. Amari in [2] to 
derive the Riemannian metric of multi-variate Gaussian distributions by means 
of divergence functions. We refer to [29, for more account on the geometry of 
Hessian structures. 

With the metric given by (|8|) the space x D"~^ is just the product of the 
Riemannian manifolds (R;^,(iso) and (D, (is^)i<fe<„_i, where 



dP"^ 

dsQ = n—j- and ds^ = {n — k) 
"0 



The latter is just n — k times the classical Poincare metric of D. Hence {TV^ x 
D"~^, ds'^) is a Cartan-Hadamard manifold whose sectional curvatures K verify 



— 4 < ii' < 0. The Riemannian distance between two different points x and y in 
R;^ X D"~^ is given by 



d{x,y) = {na{P,Qf + ^{n - k)T{iik,Vkf\ 

^ fe=i ^ 



1/2 



where a; = (P,/xi, . . . ,/i„_i), y = (Q, i/i, . . . , f„_i), 

Q -I 1 _|_ I ^k-jlk 

a(P,Q) = I ln(^)| and T(Mfc, i^fc) = o In 



The geodesic from a; to y in 7^ parameterized by arc length is given by 

where 70 is the geodesic in (R!]_, rfsg) from P to Q parameterized by arc length 
and for 1 < A: < n — 1, 7/^. is the geodesic in (D, ds\) from /x^ to parameterized 
by arc length. More precisely, 

70 (i) = ^"6*^'^"^-^), 

and for 1 < fc < n — 1, 

(Mfe + e'^'=)e2* + (/xfe-e'^'=) - Mfc 

= 7;-^— TeTTTa^n^^i TTTflTT' ^here 0ft = arg ■ 



(l + ;[ifee'^^)e2* + (l-/xfce'^^)' ^l-^Ufeffe" 
Particularly, 

7(0,.,y) = (7o(0)^^,7i(0)^^,...,7„-i(0)^^^^). 

Let V = {vo,vi,. . . ,Vn-i) be a tangent vector in TxCR.'^ x D"~^), then the 
geodesic starting from x with velocity v is given by 

at,X, V) = iCoit), Clit), ■■■ ,(n-l{t)), 

where (0 is the geodesic in (R^,cisg) starting from P with velocity vq and for 
1 < fc < n — 1, Cfe is the geodesic in (D, dsD starting from Hk with velocity Vk- 
More precisely, 

Co{t) = Pe^*, 

and for 1 < A: < n — 1, 

r {^^k + e'^'')e^^ + {nk - e'^") . „ 

Ck[t) = 2|„fe|t ^ ' where 9k = argi;^. 

(1 + /Ufee'^'«)ei-i''fci^ + (1 - Mfee'^'=) 



6.3 Radar simulations 



Now we give some simulating examples of the median method applied to radar 
target detection. 

Since the autoregressive spectra are closely related to the speed of targets, 
we shall first investigate the spectral performance of the median method. In 
order to illustrate the basic idea, we only consider the detection of one fixed 
direction. The range along this direction is subdivided into 200 lattices in which 
we add two targets, the echo of each lattice is modeled by an autoregressive 
process. The following Figure 0] gives the initial spectra of the simulation, where 
X axis represents the lattices and y axis represents frequencies. Every lattice is 
identified with a 1 x 8 vector of refiection coefficients which is calculated by using 
the regularized Burg algorithm [11] to the original simulating data. The spectra 
are represented by different colors whose corresponding values are indicated in 
the colorimetric on the right. 



Target 2 




/ 20 40 60 80 100 120 140 160 180 200 



Fig. 4. Initial spectra with two added targets 



For every lattice, by using the subgradient algorithm, we calculate the median 
of the window centered on it and consisting of 15 lattices and then we get the 
spectra of medians shown in Figure[SJ Furthermore, by comparing it with Figure 
[HI which are spectra of barycenters, we see that in the middle of the barycenter 
spectra, this is just the place where the second target appears, there is an obvious 
distortion. This explains that median is much more robust than barycenter when 
outliers come. 




20 40 60 80 100 120 140 160 180 
lattice 



Fig. 5. Median spectra 
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Fig. 6. Barycenter spectra 



The principle of target detection is that a target appears in a lattice if the 
distance between this lattice and the median of the window around it is much 
bigger than that of the ambient lattices. The following Figure [7] shows that 
the two added targets are well detected by the median method, where x axis 
represents lattice and y axis represents the distance in Tg between each lattice 
and the median of the window around it. 




lattice 



Fig. 7. Detection by median 



We conclude our discussion by showing the performance of the median method 
in real target detection. As above, we give the images of autoregressive spectra 
and the figure of target detection obtained by using real data which are records 
of a radar located on a coast. These records consist of about 5000 lattices of 
a range of about 10km-140km as well as 109 azimuth values corresponding to 
approximately 30 scanning degrees of the radar. For simplicity we consider the 
data of all the lattices but in a fixed direction, hence each lattice corresponds 
to a 1 X 8 vector of reflection coefficients computed by applying the regularized 
Burg algorithm to the original real data. Figure [5] gives the initial autoregressive 
spectra whose values are represented by different color according to the colori- 
metric on the right. For each lattice, by using the subgradient algorithm, we 
calculate the median of the window centered on it and consisting of 17 lattices 
and then we get the spectra of medians shown in Figure [9l 

In order to know in which lattice target appears, we compare the distance 
between each lattice and the median of the window around it. The following 
Figure [TU] shows that the four targets are well detected by our method, where x 
axis represents distance and y axis represents the distance in Ts between each 
lattice and the median of the window around it. 
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Fig. 8. Initial spectra of real radar data 
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Fig. 9. Median spectra of real radar data 
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Fig. 10. Real detection by median 



References 

1. B. Afsari, Riemannian U" center of mass : existence, uniqueness, and convexity, 
Proceedings of the American Mathematical Society, S 0002-9939(2010)10541-5, 
Article electronically published on August 27, 2010. 

2. S. Amari and A. Cichocki, Information geometry of divergence functions, Bulletin 
of the Polish Academy of Sciences, Technical Sciences, Vol. 58, No. 1, 2010. 

3. M. Arnaudon and X. M. Li, Barycenters of measures transported by stochastic 
flows, The Annals of probability, 33 (2005), no. 4, 1509-1543. 

4. M. Arnaudon, C. Dombry, A. Phan and L. Yang, Stochastic algorithms for com- 
puting means of probability measures, preprint hal-00540623, version 2, (2011). to 
appear in Stochastic Processes and their Applications. 

5. M. Arnaudon and F. Nielsen, On approximating the Riemannian 1-center, hal- 
00560 187- version 1 (2011). 

6. M. Arnaudon and F. Nielsen, Medians and means in Finsler geometry, hal- 
00540625-version 2 (2011), to appear in LMS J. Comput. Math. 

7. F. Barbaresco, Innovative Tools for Radar Signal Processing Based on Cartan's 
Geometry of SPD Matrices and Information Geometry, IEEE International Radar 
Conference (2008). 

8. F. Barbaresco, Interactions between Symmetric Cone and Information Geometries, 
ETVC'08, Springer Lecture Notes in Computer Science 5416 (2009), pp. 124-163. 

9. F. Barbaresco and G. Bouyt, Espace Riemannien symetrique et geometric des es- 
paces de matrices de covariance : equations de diffusion et calculs de medianes, 
GRETSr09 conference, Dijon, September 2009 

10. F. Barbaresco, New Foundation of Radar Doppler Signal Processing based on Ad- 
vanced Differential Geometry of Symmetric Spaces: Doppler Matrix GEAR and 
Radar Application, Radar'09 Conference, Bordeaux, October 2009 



11. F. Barbaresco, Annalyse Doppler: regularisation d'un probleme inverse mal pose, 
Support do cours 

12. F. Barbaresco, Science geometrique de llnformation : Geometrie des matrices de 
covariance, espace metrique de Frechet et domaines homes homogenes de Siegel, 
Conference GRETSI'll, Bordeaux, Sept. 2011 

13. F. Barbaresco, Robust Statistical Radar Processing in Frechet Metric Space: OS- 
HDR-CFAR and OS-STAP Processing in Siegel Homogeneous Bounded Domains, 
Proceedings of IRS'll, International Radar Conference, Leipzig, Sept. 2011 

14. F. Barbaresco, Geometric Radar Processing based on Frechet Distance : Informa- 
tion Geometry versus Optimal Transport Theory, Proceedings of IRS'll, Interna- 
tional Radar Conference, Leipzig, Sept. 2011 

15. R. Bhattacharya and V. Patrangenaru, Large sample theory of intrinsic and ex- 
trinsic sample means on manifolds. I, The Annals of Statistics, 2003, Vol 31, No. 
1, 1-29 

16. M. Bridson and A. Haefliger, Metric spaces of non-positive curvature, (Springer, 
Berlin, 1999). 

17. S. R. Buss and J. P. Fillmore, Spherical averages and applications to spherical 
splines and interpolation, ACM Transactions on Graphics vol. 20(2001), pp. 95- 
126. 

18. J. Burbea and C. R. Rao, Diffemtial metrics in probability spaces, Probability and 
Mathematical Statistics, Vol. 3, Fasc. 2, pp. 241-258, 1984. 

19. Z. Drczncr and G. O. Wesolowsky, Facility location on a sphere, J. Opl Res Soc. 
Vol. 29, 10, pp. 997-1004. 

20. Z. Drczncr, On location dominance on spherical surfaces. Operation Research, Vol. 
29, No. 6, November-December 1981, pp. 1218-1219. 

21. M. Emery and G. Mokobodzki, Sur le barycentre d'une probabilite dans une variete, 
Seminaire de Probabilites-XXV, Lecture Notes in Mathematics 1485. (Springer, 
Berlin, 1991), pp. 220-233. 

22. P. T. Fletcher ct al.. The geometric median on Riemannian manifolds with appli- 
cation to robust atlas estimation, Ncurolmagc, 45 (2009), S143-S152. 

23. M. Frechet, Les elements aleatoires de natures quelconque dans un espace distancie, 
Annales de I'l.H.R, tome 10, n°4 (1948), p. 215-310. 

24. H. Karcher, Riemannian center of mass and moUifier smoothing. Communications 
on Pure and Applied Mathematics, vol xxx (1977), 509-541. 

25. W. S. Kendall, Probability, convexity, and harmonic maps with small image I: 
uniqueness and fine existence, Proc. London Math. Soc, (3) 61 (1990), no. 2, 371- 
406. 

26. R. Noda, T. Sakai and M. Morimoto, Generalized Fermat's problem, Canad. Math. 
Bull. Vol. 34(1), 1991, pp. 96-104. 

27. J. Picard, Barycentres et martingales sur une variete, Ann. Inst. H. Poincare 
Probab. Statist, 30 (1994), no. 4, 647-702. 

28. A. Sahib, Esperance d'une variable aleatoire a valeur dans un espace metnque. 
These de I'Universite de Rouen (1998). 

29. H. Shima, The geometry of hessian structures. World Scientific Publishing (2007). 

30. C. Villani, Optimal Transport: Old and New. Springer- Verlag, 2009. 

31. L. Yang, Riemannian median and its estimation, LMS J. Gomput. Math, vol 13 

(2010) , pp. 461-479. 

32. L. Yang, Some properties of Frechet medians in Riemannian manifolds, preprint 

(2011) , submitted. 

33. L. Yang, Medianes de mesures de probabilite dans les varietes riemanniennes et 
applications a la detection de cibles radar, These de I'Universite de Poitiers, 2011. 



Preface 



This textbook is intended for use by students of physics, physical chemistry, 
and theoretical chemistry. The reader is presumed to have a basic knowledge 
of atomic and quantum physics at the level provided, for example, by the first 
few chapters in our book The Physics of Atoms and Quanta. The student of 
physics will find here material which should be included in the basic education 
of every physicist. This book should furthermore allow students to acquire an 
appreciation of the breadth and variety within the field of molecular physics and 
its future as a fascinating area of research. 

For the student of chemistry, the concepts introduced in this book will provide 
a theoretical framework for that entire field of study. With the help of these con- 
cepts, it is at least in principle possible to reduce the enormous body of empirical 
chemical knowledge to a few basic principles: those of quantum mechanics. In 
addition, modern physical methods whose fundamentals are introduced here are 
becoming increasingly important in chemistry and now represent indispensable 
tools for the chemist. As examples, we might mention the structural analysis of 
complex organic compounds, spectroscopic investigation of very rapid reaction 
processes or, as a practical application, the remote detection of pollutants in the 
air. 
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1 Fixed-Period Problems: The Sublinecir Case 

With this chapter, the prehminaries are over, and we begin the search for periodic 

solutions to Hamiltonian systems. All this will be done in the convex case; that 
is, we shall study the boundary- value problem 

X = JH'{t,x) 
x{0) = x(T) 

with H{t, ■) a convex function of x, going to +00 when ||a;|| — > cxd. 
1.1 Autonomous Systems 

In this section, we will consider the case when the Hamiltonian H{x) is au- 
tonomous. For the sake of simplicity, we shall also assume that it is C^. 

We shall first consider the question of nontriviality, within the general frame- 
work of {Aoo, -Bco)-subquadratic Hamiltonians. In the second subsection, we shall 
look into the special case when H is (0, 6oo)-subquadratic, and we shall try to 
derive additional information. 

The General Case: Nontriviality. We assume that H is (Aqo, -Boo) -sub- 
quadratic at infinity, for some constant symmetric matrices A^o and Boo, with 
Boo — Aoo positive definite. Set: 



7 : = smallest eigenvalue of Boo — A, 



00 



(1) 



A : = largest negative eigenvalue of J— + A, 



dt 



■00 



(2) 



Theorem [T] tells us that if A + 7 < 0, the boundary- value problem: 



X = JH'{x) 
x{0) = x{T) 



(3) 



has at least one solution x, which is found by minimizing the dual action func- 
tional: 



{A-^u,u)+N*{-u) 



dt 



(4) 



on the range of A, which is a subspace R{A)j^ with finite codimension. Here 



N{x) := H{x) - - iAooX,x) 



is a convex function, and 



1 



N{x) < -{{Boo - Aoa)x,x) +c Vx 
Proposition 1. Assume H'{0) = and H{0) = 0. 5*6^; 

S := liminf 27V(2;) \\x\r^ . 

If "f < —A < S, the solution u is non-zero: 

x{t) / \/t . 



(5) 



(6) 



(7) 



(8) 



Proof. Condition ([7]) means that, for every 5' > S, there is some e > such that 

(9) 



M<e^N{x)<^\\xf 



It is an exercise in convex analysis, into which we shall not go, to show that 
this implies that there is an 77 > such that 



fM<V->N*{y)<^\\yf 



(10) 



Fig. 1. This is the caption of the figure displaying a white eagle and a white horse on 
a snow field 
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Since ui is a smooth function, we will have < t] for h small enough, 

and inequality (jlOp will hold, yielding thereby: 



i^ihui) < |[ui||2 + ^ ||ui||^ 



(11) 



If we choose S' close enough to S, the quantity (i + jr) will be negative, and 
we end up with 

■)p{hui) < ioT h^O smah . (12) 

On the other hand, we check directly that V'(O) = 0. This shows that cannot 
be a minimizer of ip, not even a local one. So It 7^ and u ^ 71^^(0) = 0. □ 



^ and {aoo,boo)-suhquadratic at infinity. Let ^1, 
, ^AT he the equilibria, that is, the solutions of H'{^) = 0. Denote by uok the 



Corollary 1. Assume H is C and (aoo,6oo 



smallest eigenvalue of H" {(,k), and set: 

u! := Min {wi, . . . , ujk} 

If-- 



T 

— boo < -E 



On 

27r ° 



T 
27r 



(13) 
(14) 



then minimization of ip yields a non-constant T-periodic solution x. 



We recall once more that by the integer part E[a] of a £ H, we mean the 
a G ^ such that a < a < a + 1. For instance, if we take 0^0 = 0, Corollary 2 
tells us that x exists and is non-constant provided that: 



T , T 
— boo < 1 < — 
27r 27r 



or 



27r 27r 



(15) 
(16) 



Proof. The spectrum of yl is ^2Z 
given by + Ooo, where 



27r,, 

floo < < — (feo 



Ooo- The largest negative eigenvalue A is 
l) + aoo. (17) 



Hence: 



ko = E 



T 



The condition 7 < —A < S now becomes: 

27r 



< 



which is precisely condition ([14 



(18) 

(19) 

□ 
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Lemma 1. Assume that H IS on ]R^"\{0} and that H"{x) is non-degenerate 
for any x ^ 0. Then any local minimizer x of ijj has minimal period T . 

Proof. We know that x, or x + ^ for sorae constant ^ e H^", is a T-periodic 
solution of the Hamihonian system: 

X = JH'{x) . (20) 

There is no loss of generality in taking ^ 0. So il^i^) ^ ^(2?) for all x in 
some neighbom-hood oi x in W^^^ (IR/T^; R^") . 

But this index is precisely the index irix) of the T-periodic solution x over 
the interval (0,T), as defined in Sect. 2.6. So 

irix) = . (21) 

Now if X has a lower period, T/k say, we would have, by Corollary 31: 

irix) = ikT/k{x) > kir/kix) + k-l>k-l>l . (22) 

This would contradict ([2l]) . and thus cannot happen. □ 

Notes and Comments. The results in this section are a refined version of [1]; the 
minimality result of Proposition 14 was the first of its kind. 

To understand the nontriviality conditions, such as the one in formula (|16p . 
one may think of a one-parameter family xt, T e (2ttuj~'^ ,2iTb^) of periodic 
solutions, xriO) = xt{T), with xt going away to infinity when T — s> 27rw^^, 
which is the period of the linearized system at 0. 



Table 1. This is the example table taken out of The Tj^book, p. 246 



Year 


World population 


8000 B.C. 


5,000,000 


50 A.D. 


200,000,000 


1650 A.D. 


500,000,000 


1945 A.D. 


2,300,000,000 


1980 A.D. 


4,400,000,000 



Theorem 1 (Ghoussoub-Preiss). Assume H{t,x) is {0, e)-subquadratic at 
infinity for all e > 0, and T-periodic in t 

H{t, •) is convex Vi (23) 

H{-,x) is T-periodic Vx (24) 
H{t,x) > n{\\x\\) with n(s)s^"^ — > cxo as s — > cxo (25) 
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Ve>0, 3c : H{t,x) < ^\\xf + c . (26) 

Assume also that H is , and H"(t,x) is positive definite everywhere. Then 
there is a sequence Xk, A; € IN, of kT -periodic solutions of the system 

x = JH'{t,x) (27) 

such that, for every A; G E^, there is some Po G IN with: 

p > Po ^ Xpk Xk . (28) 

□ 

Example 1 (External forcing^. Consider the system: 

X = JH'{x) + f{t) (29) 

where the Hamiltonian H is (0, 6(x))-subquadratic, and the forcing term is a 
distribution on the circle: 

f=-^F + fo with FgL^ {m./T2Z- IR^") , (30) 
where := f{t)dt. For instance, 

fit) =^Ski, (31) 
feeiN 

where Sk is the Dirac mass at t = fc and ^ G K^" is a constant, fits the pre- 
scription. This means that the system x = JH'(x) is being excited by a series 
of identical shocks at interval T. 

Definition 1. Let Aoo{t) and Boo{t) he symmetric operators m H^", depending 
continuously on t ^ [0,7"], such that Aoo{t) < Boo(^) for all t. 

A Borelian function H : [0,T] x H^" IR is called {A^,B^)-subquadratic 



at infinity if there exists a function N{t, x) such that: 

Hit, a;) = i {A^{t)x, x) + N{t, x) (32) 

Vt , N{t, x) is convex with respect to x (33) 

N{t,x) > n{\\x\\) with n(s)s~-^ -)■ +oo as s +oo (34) 

3c G IR : H{t, x)<^ iB^{t)x, x) + c \/x . (35) 



If Aoc{t) = aoci and B^{t) = b^ol, with < 6oo € IR, we shall say that 

H is (aQc,boo)-subquadratic at infinity. As an example, the function with 
1 < a < 2, is {0, e)-subquadratic at infinity for every e > 0. Similarly, the 
Hamiltonian 

H{t,x) = lk\\kf + \\xr (36) 
is {k, k + e)-subquadratic for every £ > 0. Note that, if k <0, it is not convex. 
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Notes and Comments. The first results on subharmonics were obtained by Ra- 
binowitz in [5] , who showed the existence of infinitely many subharmonics both 
in the subquadratic and superquadratic case, with suitable growth conditions 
on H' . Again the duality approach enabled Clarke and Ekeland in 2 to treat 
the same problem in the convex-subquadratic case, with growth conditions on 
H only. 

Recently, Michalek and Tarantello (see [3] and [4]) have obtained lower bound 
on the number of subharmonics of period kT, based on symmetry considerations 
and on pinching estimates, as in Sect. 5.2 of this article. 
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1 Fixed-Period Problems: The Sublinear Case 

With this chapter, the preliminaries are over, and we begin the search for periodic 
solutions to Hamiltonian systems. All this will be done in the convex case; that 
is, we shall study the boundary-value problem 

X = JH'{t,x) 
x{f)) = x{T) 

with H{t, •) a convex function of x, going to +oo when oo. 
1.1 Autonomous Systems 

In this section, we will consider the case when the Hamiltonian H{x) is au- 
tonomous. For the sake of simplicity, we shall also assume that it is . 

We shall first consider the question of nontriviality, within the general frame- 
work of (^oo, -Bco)-subquadratic Hamiltonians. In the second subsection, we shall 
look into the special case when H is (0, 6oo)-subquadratic, and we shall try to 
derive additional information. 

The General Case: Nontriviality. We assume that H is {Aao^B^)-su}o- 
quadratic at infinity, for some constant symmetric matrices and Boo, with 
Boo — Aoo positive definite. Set: 

7 : = smallest eigenvalue of B^o — ^oo (1) 
A : = largest negative eigenvalue of -|- A^o ■ (2) 



Theorem 21 tells us that if A + 7 < 0, the boundary- value problem: 



X = JH'{x) 
x{0) = x{T) 



(3) 



has at least one solution x, which is found by minimizing the dual action func- 
tional: 



{A-^u,u)+N*{-u) 



dt 



(4) 



on the range of A, which is a subspace R{A)j^ with finite codimension. Here 



N{x) := H{x) - - iAooX,x) 



is a convex function, and 



1 



N{x) < -{{Boo - Aoa)x,x) +c Vx 
Proposition 1. Assume H'{0) = and H{0) = 0. 5*6^; 

S := liminf 27V(2;) \\x\r^ . 

If "f < —A < S, the solution u is non-zero: 

x{t) / \/t . 



(5) 



(6) 



(7) 



(8) 



Proof. Condition ([7]) means that, for every 5' > S, there is some e > such that 

(9) 



M<e^N{x)<^\\xf 



It is an exercise in convex analysis, into which we shall not go, to show that 
this implies that there is an 77 > such that 



fM<V->N*{y)<^\\yf 



(10) 



Fig. 1. This is the caption of the figure displaying a white eagle and a white horse on 
a snow field 
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Since ui is a smooth function, we will have < t] for h small enough, 

and inequality (jlOp will hold, yielding thereby: 



i^ihui) < |[ui||2 + ^ ||ui||^ 



(11) 



If we choose S' close enough to S, the quantity (i + jr) will be negative, and 
we end up with 

■)p{hui) < ioT h^O smah . (12) 

On the other hand, we check directly that V'(O) = 0. This shows that cannot 
be a minimizer of ip, not even a local one. So It 7^ and u ^ 71^^(0) = 0. □ 



^ and {aoo,boo)-suhquadratic at infinity. Let ^1, 
, ^AT he the equilibria, that is, the solutions of H'{^) = 0. Denote by uok the 



Corollary 1. Assume H is C and (aoo,6oo 



smallest eigenvalue of H" {(,k), and set: 

u! := Min {wi, . . . , ujk} 

If-- 



T 

— boo < -E 



On 

27r ° 



T 
27r 



(13) 
(14) 



then minimization of ip yields a non-constant T-periodic solution x. 



We recall once more that by the integer part E[a] of a £ H, we mean the 
a G ^ such that a < a < a + 1. For instance, if we take 0^0 = 0, Corollary 2 
tells us that x exists and is non-constant provided that: 



T , T 
— boo < 1 < — 
27r 27r 



or 



27r 27r 



(15) 
(16) 



Proof. The spectrum of yl is ^2Z 
given by + Ooo, where 



27r,, 

floo < < — (feo 



Ooo- The largest negative eigenvalue A is 
l) + aoo. (17) 



Hence: 



ko = E 



T 



The condition 7 < —A < S now becomes: 

27r 



< 



which is precisely condition ([14 



(18) 

(19) 

□ 
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Lemma 1. Assume that H IS on ]R^"\{0} and that H"{x) is non-degenerate 
for any x ^ 0. Then any local minimizer x of ijj has minimal period T . 

Proof. We know that x, or x + ^ for sorae constant ^ e H^", is a T-periodic 
solution of the Hamihonian system: 

X = JH'{x) . (20) 

There is no loss of generality in taking ^ 0. So il^i^) ^ ^(2?) for all x in 
some neighbom-hood oi x in W^^^ (IR/T^; R^") . 

But this index is precisely the index irix) of the T-periodic solution x over 
the interval (0,T), as defined in Sect. 2.6. So 

irix) = . (21) 

Now if X has a lower period, T/k say, we would have, by Corollary 31: 

irix) = ikT/k{x) > kir/kix) + k-l>k-l>l . (22) 

This would contradict ([2l]) . and thus cannot happen. □ 

Notes and Comments. The results in this section are a refined version of 119801 
the minimality result of Proposition 14 was the first of its kind. 

To understand the nontriviality conditions, such as the one in formula (|16p . 
one may think of a one-parameter family xt, T e (2'kuj~'^ ,2iTb^) of periodic 
solutions, xriO) = xt{T), with xt going away to infinity when T — s> 27rw^^, 
which is the period of the linearized system at 0. 



Table 1. This is the example table taken out of The Tj^book, p. 246 



Year 


World population 


8000 B.C. 


5,000,000 


50 A.D. 


200,000,000 


1650 A.D. 


500,000,000 


1945 A.D. 


2,300,000,000 


1980 A.D. 


4,400,000,000 



Theorem 1 (Ghoussoub-Preiss). Assume H{t,x) is {0, e)-subquadratic at 
infinity for all e > 0, and T-periodic in t 

H{t, •) is convex Vi (23) 

H{-,x) is T-periodic Vx (24) 
H{t,x) > n{\\x\\) with n(s)s^"^ — > cxo as s — > cxo (25) 
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Ve>0, 3c : H{t,x) < ^\\xf + c . (26) 

Assume also that H is , and H"(t,x) is positive definite everywhere. Then 
there is a sequence Xk, A; € IN, of kT -periodic solutions of the system 

x = JH'{t,x) (27) 

such that, for every A; G E^, there is some Po G IN with: 

p > Po ^ Xpk Xk . (28) 

□ 

Example 1 (External forcing^. Consider the system: 

X = JH'{x) + f{t) (29) 

where the Hamiltonian H is (0, 6(x))-subquadratic, and the forcing term is a 
distribution on the circle: 

f=-^F + fo with FgL^ {m./T2Z- IR^") , (30) 
where := f{t)dt. For instance, 

fit) =^Ski, (31) 
feeiN 

where Sk is the Dirac mass at t = fc and ^ G K^" is a constant, fits the pre- 
scription. This means that the system x = JH'(x) is being excited by a series 
of identical shocks at interval T. 

Definition 1. Let Aoo{t) and Boo{t) he symmetric operators m H^", depending 
continuously on t ^ [0,7"], such that Aoo{t) < Boo(^) for all t. 

A Borelian function H : [0,T] x H^" IR is called {A^,B^)-subquadratic 



at infinity if there exists a function N{t, x) such that: 

Hit, a;) = i {A^{t)x, x) + N{t, x) (32) 

Vt , N{t, x) is convex with respect to x (33) 

N{t,x) > n{\\x\\) with n(s)s~-^ -)■ +oo as s +oo (34) 

3c G IR : H{t, x)<^ iB^{t)x, x) + c \/x . (35) 



If Aoc{t) = aoci and B^{t) = b^ol, with < 6oo € IR, we shall say that 

H is (aQc,boo)-subquadratic at infinity. As an example, the function with 
1 < a < 2, is {0, e)-subquadratic at infinity for every e > 0. Similarly, the 
Hamiltonian 

H{t,x) = lk\\kf + \\xr (36) 
is {k, k + e)-subquadratic for every £ > 0. Note that, if k <0, it is not convex. 
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Notes and Comments. The first results on subharmonics were obtained by Rabi- 
nowitz in I1985[ who showed the existence of infinitely many subharmonics both 
in the subquadratic and superquadratic case, with suitable growth conditions 
on H' . Again the duality approach enabled Clarke and Ekeland in 1981, to treat 
the same problem in the convex-subquadratic case, with growth conditions on 
H only. 

Recently, Michalak and Tarantello (see Michalek, R., Tarantello, G. ll982l and 
Tarantello, G. I1983P have obtained lower bound on the number of subharmonics 
of period fcT, based on symmetry considerations and on pinching estimates, as 
in Sect. 5.2 of this article. 
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